13 Essential SOL 
Commands 


This a cheat sheet for 
the most common 
SQL commands 

| use as a data 


analyst. 


The example dataset 
shows game results for 
NCAA basketball 


tournaments. 


1. SELECT 


Select the columns you want. 


SELECT 
season 
, round 
, Win seed 
, Win school ncaa 
, Win pts 
, lose seed 
, lose school ncaa 
, lose pts 
, num ot 
FROM 'examples-378922.ncaa.basketball' 


Row season round win seed win school ncaa win pts lose seed lose school ncaa lose pts num ot 
4 4 4 4 4 4 4 4 4 4 


1 1989 64 Evansville 94 06 Oregon St. 90 1 
1997 32 Chattanooga 75 06 Illinois 63 
1997 64 Chattanooga 73 03 Georgia 70 
1994 64 Temple 61 13 Drexel 39 
1985 64 Temple 60 09 Virginia Tech 57 
1991 64 Temple 80 07 Purdue 63 
1988 32 Temple 74 08 Georgetown 53 
1999 Temple 77 10 Purdue 55 


2. WHERE 


Filter your data based 
on any criteria. 


SELECT 


* 


FROM 'examples-378922.ncaa.basketball' 
WHERE season BETWEEN 1990 and 1999 
AND round <= 16 


3. ORDER BY 


Sort your data any way 
you want. Y f 


SELECT 


* 


FROM 'examples-378922.ncaa.basketball' 
ORDER BY season, round desc 


season round days from epoch game date win seed win region win market 
a 4 4 a 4 


és 


— 


1985 64 5551 1985-03-14 Thursday 08 Temple 


1985 64 5552 1985-03-15 Friday 02 Virginia Commo 


1985 64 5551 1985-03-14 Thursday 01 Georgetown 


1985 64 5552 1985-03-15 Friday 11 Boston College 


1985 64 5552 1985-03-15 Friday 11 UTEP 


1985 64 5552 1985-03-15 Friday 07 Syracuse 
1985 64 5551 1985-03-14 Thursday 04 UNLV 
1985 64 5551 1985-03-14 Thursday 11 Auburn 


1985 64 5552 1985-03-15 Friday 02 Memphis 


2 
3 
4 
9 
6 
7 
8 
9 
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— 


1985 64 5552 1985-03-15 Friday 07 UAB 


— 
— 


1985 64 5551 1985-03-14 Thursday 01 Oklahoma 


— 
N 


1985 64 5551 1985-03-14 Thursday 04 Ohio State 
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1985 64 5552 1985-03-15 Friday 06 Georgia 


4. GROUP BY 


Agsregate data based 
on a calculation. 


SELECT 
season 
, count(*) as num_games 
, sum(num_ot) as total_overtimes 
, max(win_pts - lose_pts) as biggest_victory 


FROM 'examples-378922.ncaa.basketball' 
GROUP BY season 


season num. games total overtimes biggest victory 
4 4 4 4 4 


1985 63 3 25 
1986 63 49 
1987 63 34 
1988 63 40 
1989 63 43 
1990 63 39 
1991 63 41 


5. HAVING 


Similar to WHERE but for 
aggregate functions. 


SELECT 
season 
, count(*) as num games 
, Sum(num ot) as total overtimes 
, max(win pts - lose pts) as biggest victory 
FROM 'examples-378922.ncaa.basketball' 
GROUP BY season 
HAVING max(win pts - lose pts) > 40 


season num games total overtimes biggest victory 
4 4 4 4 


63 3 43 
63 41 
63 41 
67 47 
63 49 
67 42 
64 43 
63 45 


6. CASE WHEN 


Add If/Else logic to a query. 


SELECT 
season 
win_school_ncaa 
win_pts 
lose_school_ncaa 
lose_pts 
win_pts - lose_pts as win_margin 


case 
when win_pts - lose_pts >= 20 
then 'blowout' 
when win_pts - lose_pts <= 3 
then 'close game' 
else 'normal' end as result type 


FROM 'examples-378922.ncaa.basketball' 


season P win_school_ncaa , win pts ? lose school ncaa P lose_pts ; win margin F result_type 
1996 Georgia Tech 103 Boston College 89 normal 
1995 UCLA 102 UConn 96 normal 
1990 UNLV 102 Little Rock 72 blowout 
2009 Missouri 102 Memphis 91 normal 
2004 UAB 102 Washington 100 close game 
1988 lowa 102 Florida St. 98 normal 
1994 Kansas 102 Chattanooga 73 blowout 
2012 Kentucky 102 Indiana 90 normal 
1989 NC State 102 lowa 96 normal 
1990 Texas 102 Xavier 89 normal 
2011 North Carolina 102 LIU Brooklyn 87 normal 
1989 Michigan 102 Virginia 65 blowout 
1992 Michigan 102 ETSU 90 normal 


/. IN 


Filter by a list of items. 


SELECT 
* 


FROM 'examples-378922.ncaa.basketball' 
WHERE win. school ncaa in ('Kansas', Michigan ', 'Duke') 


8. LIKE 


Returns true If data 
matches a text pattern. 


| SELECT 


DISTINCT win school ncaa 
FROM 'examples-378922.ncaa.basketball' 
WHERE win school ncaa like '%lexas%' 


Row win school ncaa 


1 Texas A&M 
2 Texas 


Texas Tech 


9. LEFT JOIN 


Join together data from 
multiple tables. 


-- Get Winning Team Colors 

SELECT 
b.game_date 
, b.win_school_ncaa 
BOT 

FROM "examples-378922.ncaa.basketball b 

LEFT JOIN bigquery-public-data.ncaa basketball.team colors c 
on b.win team id = c.id 


game date win school ncaa color 
PA 


1985-03-14 Temple #9e1b34 
1985-03-14 Georgetown #00275d 
1985-03-14 UNLV #cc092f 
1985-03-14 Auburn #f58025 
1985-03-14 Oklahoma #a80532 
1985-03-14 Ohio St. #bb0000 
1985-03-14 Louisiana Tech #002f8b 
1985-03-14 Kansas #006ab5 


10. UNION ALL 


Append data together. 


-- Combine Regular Season & Tournament Games 
-- **Columns need to be the same. 


-- Regular Season 
SELECT 
season 


, name as win team 


, points game as win team pts 

, Opp name as lose team 

, Opp points game as lose team pts 
FROM ^"bigquery-public-data.ncaa basketball.mbb historical teams games" 
WHERE win = true 


UNION ALL 


-- Tournament 
SELECT 
season 
, Win name 
, Win pts 
, lose name 
, lose pts 
FROM bigquery-public-data.ncaa basketball.mbb historical tournament games" 


11. CTEs 


Write clean SQL. 


with winning margin as ( 
SELECT 
* 
| , Win pts - lose pts as win margin 
FROM 'examples-378922.ncaa.basketball' 


SELECT * 
FROM winning. margin 
WHERE win margin > 20 


12. WINDOW FUNCTIONS 


Perform calculations 


aCross TOWS. 


SELECT 


game date 
win name 
lose name 
win pts + lose pts as total points 
ROW NUMBER() OVER(PARTITION BY win name ORDER BY game date) AS team game num 


SuM(win pts+lose pts) OVER (ORDER BY game date, win team id) AS running sum pts 


FROM 'examples-378922.ncaa.basketball' 


— 
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game date 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 
1985-03-14 


win name 
Tigers 
Wildcats 
Mustangs 
Redbirds 
Ramblers 


Rebels 


Fighting Irish 


Buckeyes 
Owls 


Hoyas 


lose_name 
Boilermakers 
Huskies 
Monarchs 
Trojans 
Gaels 

Aztecs 
Beavers 
Cyclones 
Hokies 


Mountain Hawks 


total points — team game num running sum pts 
4 4 4 


117 
124 
153 
113 
117 
165 
149 
139 
117 
111 


1 
1 
1 


117 
241 
394 
507 
624 
789 
938 
1077 
1194 
1305 


13. DATE FUNCTIONS 


Work with dates and times. 


SELECT 
game 


_date 


, extract(day from game date) as day of month 
, current date() as today 


, date add(current. date(), 


, date trunc(game date, month) as first of month 
, last day(game date) as end of month 
, date diff(last day(game date),game date, day) as days left in month 
FROM 'examples-378922.ncaa.basketball' 


— 


2 
3 
4 
5 
6 
7 
8 
9 
0 


1 


game. date 
1989-03-17 
1997-03-16 
1997-03-14 
1994-03-18 
1985-03-14 
1991-03-14 
1988-03-20 
1999-03-19 
1999-03-14 
1988-03-24 


day_of_mı today 


17 
16 
14 
18 
14 
14 
20 
19 
14 
24 


2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 
2023-02-26 


today_plus_3 y 


2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 
2023-03-01 


first of month, 


1989-03-01 
1997-03-01 
1997-03-01 
1994-03-01 
1985-03-01 
1991-03-01 
1988-03-01 
1999-03-01 
1999-03-01 
1988-03-01 


interval 3 day) as today_plus_3 


end of month days left ii 
4 4 


1989-03-31 
1997-03-31 
1997-03-31 
1994-03-31 
1985-03-31 
1991-03-31 
1988-03-31 
1999-03-31 
1999-03-31 
1988-03-31 


14 
15 
17 
13 
17 
17 
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Like this cheat sheet? 


You'll love my BigQuery 
Jumpstart course. 


You learn how to use 
BigQuery and access 
hundreds of free 
public datasets. 


Link in bio. 


